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Abstract — In this paper, we show that good structured codes 
over non-Abelian groups do exist. Specifically, we construct codes 
over the smallest non-Abelian group De and show that the 
performance of these codes is superior to the performance of 
Abelian group codes of the same alphabet size. This promises 
the possibility of using non-Abelian codes for multi-terminal 
settings where the structure of the code can be exploited to gain 
performance. 

Index Terms — group codes, structured codes, achievable rate, 
non-Abelian groups 



I. Introduction 

ALGEBRAICALLY structured codes are an important 
class of codes in coding/information theory and commu- 
nications and evaluating the information-theoretic performance 
limits of such codes has been an area of significance |lj- 
Q. It is well-known that linear codes achieve the symmetric 
capacity of g-ary channels where g is a prime ||7j ||6). Linear 
codes can also be used to compress a binary source losslessly 
down to its entropy 1 8 1 . Optimality of linear codes for certain 
communication problems motivates the study of algebraic- 
structured codes including Abelian and non-Abelian group 
codes. 

In (|8| it has been shown that for some multi-terminal 
communication settings, the average asymptotic performance 
of the ensemble of structured codes can be better than that of 
random codes. In recent years, such gains have been shown 
for a wide class of multi-termianl problems fS], f9^, pO) . 
Thus, characterization of the information theoretic perfor- 
mance limits of these codes became important. However, the 
structure of the code restricts the encoder to abide by certain 
algebraic rules. This causes the performance of such codes to 
be inferior to random codes in some communication settings. 
Linear codes are highly structured and for some problems in 
information theory they cannot be optimal. Moreover, these 
codes can only be defined over alphabets of size a power of 
a prime. 

Group codes are a generalization of linear codes which are 
algebraically structured and can be defined for any alphabet. 
These codes can outperform unstructured codes in certain 
communication problems |5|. Group codes were first studied 
by Slepian for the Gaussian channel. In |12|, the capacity 
of group codes for certain classes of channels has been 
computed. Further results on the capacity of group codes were 
estabhshed in Q, ||T3), g^. 

This work was supported by NSF grants CCF-0915619 and CCF-1 116021. 



It has been shown in |14| that Abelian group codes do not 
achieve the capacity of arbitrary channels. It has also been 
conjectured by several authors that non-Abelian group codes 
are inferior to Abelian group codes p3) p6| |17|. This 



motivates a loosening of the structure of the code yet further 
In this work, we focus on the point-to-point channel coding 
problem. We define a class of structured codes which includes 
the class of group codes and has less structure compared to 
group codes. We evaluate the performance of such codes over 
the smallest non-Abelian group Dg and show that these codes 
have a strictly better performance compared to Abelian group 
codes. We use a combination of algebraic and information- 
theoretic tools for this task. This observation broadens our 
view to structured codes for possible use in multi-terminal 
settings. 

The paper is organized as follows: In Section |Il] we intro- 
duce our notation and develop the required background. In 



Section III we define the ensemble of codes. We analyze the 



performance of theses codes in Section |lVj where we solve an 
optimization problem and make several counting arguments. 
We compare the performance of the constructed codes to the 
performance of Abelian group codes in Section |V] and we 
conclude in Section [Vl] 

II. Preliminaries 

1) Groups: A group is a set G equipped with a binary 
operation "•" to form an algebraic structure. The group opera- 
tion "•" must satisfy the group axioms (closure, associativity, 
identity and invertibility). A group is called Abelian if its 
operation is commutative and non-Abelian otherwise. 

2) Group Codes: Given a group G, a group code C over 
G with block length n is any subgroup of G" |j3], p8) . A 
shifted group code over G, C + u is a translation of a group 
code C by a fixed vector v e G". 

3) Source and Channel Models: We consider discrete mem- 
oryless and stationary channels used without feedback. We 
associate two finite sets X and y with the channel as the 
channel input and output alphabets. These channels can be 
characterized by a conditional probability law T4^(2/|a:) for 
X ^ X and y E y. The set X admits the structure of 
a finite Abelian group G of the same size. The channel is 
specified by {G,y,W). Assuming a perfect source coding 
block applied prior to the channel coding, the source of 
information generates messages over the set {1,2,...,M} 
uniformly. 
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4) Achievability and Capacity: A transmission system with 
parameters (n, M, r) for reliable communication over a given 
channel (G, y, W) consists of an encoding mapping and a 
decoding mapping 

e : {1,2,...,M} G" 
/:y'^{L2,...,M} 



such that for all m 

A/ 



1,2,..., M, 



e(m)) < r 



Given a channel (G, 3^, M^), the rate R is said to be achievable 
if for all e > and for all sufficiently large n, there 
exists a transmission system for reliable communication with 
parameters (n, M, t) such that 

- log M > i? - e, T < e 
n 

The capacity of the channel is defined as the supremum of the 
set of all achievable rates. 

5} Typicality: Consider two random variables X and Y 
with joint probability density function px.y{x^ y) over X xy. 
Let n be an integer and e be a positive real number The 
sequence pair (a;",y") belonging to A"" x 3^" is said to be 
jointly e-typical with respect to px.Y{x,y) if 



Va e A", V6 e 3^ 



7V(a,&|x",2/")-px,y(a,6) 



< 



\x\\y\ 

and none of the pairs (a, 6) with px,Y{a,h) — occurs in 
(a;",?/"). Here, iV(a, 6|a;", y") counts the number of occur- 
rences of the pair (a, &) in the sequence pair {x",y"). We 
denote the set of all jointly e-typical sequence pairs in A"" x y" 
by A^X^Y). 

Given a sequence x" £ A"", the set of conditionally e-typical 
sequences is defined as 



^','(r|a;") = {y" G J" |(a;",2/") € ^^(X,y)} 



(1) 



6J Dihedral Groups: A dihedral group of order 2p is the 
group of symmetries of a regular p-gon, including reflections 
and rotations and any combination of these operations. A 
dihedral group can be represented as a quotient of a free group 
as follows: 



D 



2p 



{x,y\xP = = l^xyxy = 1) 



Dihedral groups are among the simplest non-Abelian groups. 

7) Notation: In our notation, 0(e) is any function of e such 
that lime^o 0(e) = and for a set A, \A\ denotes its size 
(cardinality). 

III. A Class of Structured Codes 

Based on Forney's analysis of group codes [3 |, we construct 
a class of structured codes which we call pseudo-group codes. 

In this note, we consider a class of pseudo-linear codes over 
Dq and find a lower bound on the capacity of such codes. 
Let Ai, - ■ ■ , An be subgroups of G. We propose a method to 
construct a code whose output at time k forms the subgroup 
Ak of G and whose input is the subgroup Fk of Ak'. 



Choose the controllability index of the code v G Z+. 
For each k, choose a normal series Fk.o < ^l-.i <\ ■ ■ ■ < 



F, 



Fk in Fu- 



For each k, define granules T^^^k] 
1 ^ J < define granules T^^.k+j] 
For each k and for all < j 



such that C[k,k+j] < and C[k^k+j]/C\k^k+j^i] ■ 

C[fc+i,fc+j] = ^[k,k+j] where A|f is the projection of 
A^ onto the interval [k, k + j]. In other words, C[k,k+j] 
is isomorphically an extension of C[k,k+j-i] • C![k+i^k+j] 
by T[k,k+j] inside ylj*^'*^^"''. Note that at least one such 
extension exists (direct product) and note that the projec- 



= Ffc and for all 

= Fk,j/Fkj^i. 
< V find Cyk,k^j\ 



tions Gi 



[fc,A;+j-l] 



are defined recursively. 



Choose the mappings Tk : Fk 



G, 



\kM 



such that T 



is a transversal function of the subgroup C\k.k+v-\\ ■ 

C[fe+l,fc + !y] of G[fe^fc + ^]. 

• Given an information digit sequence . . . uoUiU2 ■ ■ ■, Ui £ 
Z, the output sequence is . . . To(uo)Ti(ui)T2(u2) . . .. 

Definition III.l. Any code constructed using the above con- 
struction algorithm is called a pseudo-linear code over G. 

Tlieorem III.l. Any free group code is a pseudo-linear code. 

Proof: It has been shown in [3] that any free group code 
can be reconstructed using the above construction algorithm. 
This completes the proof. ■ 
For Abelian groups, the definition of pseudo-group codes 
coincides with the definition of group codes but for non- 
Abelian groups this class is larger than the class of group 
codes; i.e. it includes all group codes as well as some non- 
group codes. In this paper, we use these codes for the smallest 
non-Abelian group Dg and show that this loosening of the 
structure results in a better performance. The generalization 
of the analysis to dihedral groups where p is a prime is 
relatively straight forward. 

The group Dg with presentation Dg — {x,y\x^ — 
l,y^ = l,xyxy = 1) can be characterized by a set 
{l,x,x^ ,y,xy,x^y} with the following table of operations: 





1 


X 


x' 


y 


xy 


x^y 


1 


1 


X 


x' 


y 


xy 


x^y 


X 


X 


x' 


1 


xy 


x'^y 


y 


x' 


x' 


1 


X 


x^y 


y 


xy 


y 


y 


x'^y 


xy 


1 


x' 


X 


xy 


xy 


y 


x^y 


X 


1 


x' 


9 

x^y 


x^y 


xy 


y 


x' 


X 


1 



Note that for two elements 5, h in Dg, g ■ h may not be 
equal to h ■ g. In the following, we construct the ensemble of 
codes over Dg. Consider codes with input group Fk = Dg 
and output group Ak = Dg for all k. Let ^ be a nonnegative 
integer number We choose the following normal series: 



Fk,o = l<i Fk,i = I <i... = 1 < Ffc^, = Dg = Fk 

(In the most general case we can have a chain isomorphic to: 

i<ii<...<]i<iZ2<]...<iZ2<irog<]...<Dg 
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but it can be shown that this chain can do no better.) 

Since the input and output groups are not changing over time, 

the granules are time independent. Define Tj — Fj^, fe^jj. Then, 

To — Fkfl = C[k.k] = 1 
Ti = Fk^i/Fkfi = 1 



Tjy — -F'fc.jy/^/c.iy-l — IDg 



k\0 



Let . . . uqUiU2 ■ ■ ■ be the information digits where 
Ui = x"'^y''\ then the output of the code is ...cqCiC2-.- 
where c, = gl^^h^::,. ' ' ' 9tXo- 

Assume the input is fed circularly to the code (tail biting). 
We also add a dither to the code. 

Here we give a summary of the resulting ensemble of 
codes. Each code in this ensemble has a rate of i? = - loe:6. 

n o 



Next step is to find the projection of the code over finite 
intervals. Note that in this case we have C[k.k+j] — C'[o,i]- 

C[o,o] — To = 1, C[o,o] < ©6 ^ ^[0,0] = 1 
C[i,i] = C[o,o] = 1 
^ Qia] • C[o,a] = 1 

C[OA]/C[ofi] ■ C[i^i] = Ti = liC'[o,i] < ©6 ^ ^[1,1] = 1 



C'[0,iy-l]/C'[0,I^-2] ■ C'fl,!/-!] — r^_l = l,C[0,,y_l] < iU^Q 

=^ Qo.iz-l] = 1 

C[l,!^] — C'[0,!^-1] = 1 



Therefore, 



qo.H = (g",h"|(g'r = 1, (h")^ = l,g"h"g"h" = 1) (2) 



where g°,h° e ©1^+^ 



{h")' = l^h°e{l,y,xy,x'yr+' 



For i — 1, • • • ,n and = 1, • • • ,k choose gij and hij 
randomly according to Table 1. for ^ 

gi'j', hiijr). 



{gij , hij ) is chosen independently from {gi- 



For i = 1, • ■ ■ choose the dither Bi uniformly ran- 
domly from Dg- 

Given the input sequence u = {ui, ■ ■ ■ ,Uk) where Ui = 
^.ttiybi^ e Z3, bi e Z2 for i = 1, • • • , fc, the output 
sequence is equal to c = (ci, • • • , c„) where 

„ _ „ai „a2 J,t'2 „°-kl,bk r> 



C2 - 521 ^2lff22 "-22 • • • 92k "'2k ' ^2 



— o"! ■ 

— ynl"'niyn2'*n2 

We denote this by c = G(u) • B. 



Jnk'^nk ^" 



(3) 





1 


y 




x'^y 


1 


1 

10 


1 

10 


1 

10 


1 

10 


X 





1 


1 


1 







10 


10 


10 



Fig. 1: gij is chosen from {l^x.x^} and /i^^ is chosen 
from {y, a;j/, x'^y}. The number in the table shows the joint 
probability of (5^, hij) being picked. 



It can be shown that if we take g° = <7oo.9oi • ■ • gof ™d 
h° = hoohoi . . . goi, where goi and /iq; are chosen jointly 
according to Table 1 then the third condition in Equation [2] 
will also be satisfied. Therefore for any such g" and h'', the 
group qo.H - (g°,h0|(g0)3 ^ l,(h0)2 = l,gVgOhO = 1) 
is a subgroup of Dq. As tough our input group is only 
restricted to be a subgroup of Dg and not necessarily Dg 
itself. 

Similarly, We get qk.k+.] = (g", h"|(g^-)' = 1, (h')' = 



l,g'=hVh'' 



1) where g 



5feo5fci • • • 5fei/ and 



h' = hkohki ■ ■ ■ gku and goi's and ft-oi's are chosen 
according to Table 1. 

Note that any element in Dg can be uniquely written as 
x°'yP for some a e Z3 and /3 e Z2. Define the transversal 
functions as: 



We evaluate the performance of these codes using a random 
coding argument in the next section. 

IV. Main Result 

In this section we show the existence of good structured 
codes over the non-Abelian group Dg by proving the following 
theorem: 

Theorem IV.l. For the channel {J)f„y,W), let X be a 
uniform random variable over tlie channel input and let the 
random variable [X] indicate the coset o/{l,a;,x^} in Dg 
where X belongs to. i.e. 



{l,x,x'^} if X e {l.x.x"^} 
{y, xy, x'^y} if X e {y, xy, x'^y} 
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Then the rate R* is achievable using pseudo-group codes over 
Dg where 

/ 1 fi 

R* = min (log, 6 - HiX\Y), ^ [log^ 3 - H{X\[X]Y)] 

The rest of this section is devoted to give a sketch of 
the proof of this theorem. Consider the class of pseudo- 
group codes over Dg of the form ^ used for the channel 
(Dgj y, W). The set of messages is Dg and for each message 
u E Dq the encoder maps it to c e Dg where c = G{u) ■ B. 
At the receiver, after receiving the channel output y G 3^", the 
decoder looks for a message u e such that c = G{u) ■ B 
is jointly e-typical with y with respect to PxWy\x where Px 
is uniform over Dg and e > is arbitrary. If it finds a unique 
such c, it decodes y to u, otherwise it declares error 
The expected value of the average probabiUty of error for this 
coding scheme is given by 

W.{Pa.a{err)} - E ^ E^(^(") ' ^ = c) ^ ^ 



P{G{u) ■ B = 5|G(w) • B ^ c)W{y\c) + 0(e) 



We need to evaluate the conditional probability P{G{u) ■ 
B = c\G{u)-B = c) to proceed. For u,u G Dg' and x,x G Dg, 
let u — {ui,--- ,Uk) where Ui = x°-^y^^ for i = I,-- - ,A: 
and u = [ui, ■ ■ ■ , Uk) where Ui ~ x°-^y''' for i — 1, • • • ,k. 
Also let c = (ci, • • • , c„) and c ~ (ci, • • • , c„) and define 
6* = cc-^ = ifi*™) where 9, = x"'?/^'. Define the 

following: 

N,{c,c) = {ie[i,- 
N2ic,c) = {ie[i,- 
N3{c,c) = {ie[i,- 

Mi{u,u) = |« e [1, 
M2{u,u) = {j e [1, 

M3{u,u) = e [1, 

Also define ni(c, c) = |7Vi(c, c)|, n2(c, c) = |7V2(c, c)|, 
713(0, c) = |7V3(c, c)|, mi(u,'S) = \Mi{u,u)\, m2{u,u) = 
|M2(u,m)|, TO3(u,u) = |M3(u,u)|. 

Lemma IV.l. For u, u e D| ant/ c, c e Dg, if u ^ u, then 

P{G{u) ■ B = 5|G(u) -5 = 0) 



mi ^ 

/ odd 



k = 0, a, ^ 0} 

'i = 0, Q^i = 0} = n — ni — 77-2 
,k]\bi 7^ foij 

= fei,ai = aj|=/s-TOi- m2 



1=2 

I even 



-1 r|fc-r?Ji-m2 ('1 nm2 _ 1 'I ™± /m , \ 



1=2 

I even 



\T„ 



3ml ^ 2™2 

31ml , 2™^ 



Moreover, for a fixed u, let Tm-^^m2 ('^') f^^^ of all u with 
mi[u,u) — mi, m2{u,u) ~ m2, then 

k 

^■mi,m2,m3y 

k \ /k — mi 
^mij \ TO2 
Proof: First Note that 

P {G{u) ■ B = £\G{u) ■ B = c) = 
P {G{u)G-\u) = 9\G{u) ■B = c) = 
P{G{u)G-\u) = 9) 

where the first equality follows since the multiplication oper- 
ation is invertible and the second equality follows since B is 
uniform and independent from other variables. We have 



p{G{u)G-^{u) = e) = 

n 

nP ( o"^^ h''^ o"^ h''^ ■ ■ ■ a°-''h^''h^ 
^ \yil "-11^12 'H2 iJik "'ik"'i. 



k9ik 



Hence, we first find the ith probability in this expression for 
some arbitrary i e 1, 2, • • • , n. Consider the case where Pi = 
1. Since the ensemble has a uniform distribution, we need to 
count the number of g^j 's and /i^ 's such that the equality 



i/il "■iiyi2 "'12 



ak ibk ,bk -ak 
'Jik "'ik"'ikilik 



(4) 



is satisfied and divide this number to the total number of 
choices. Use the equality yx — x^y to argue that the power of 
y on the left hand side of this expression adds. i.e. the power 
of y on the left hand side is equal to the sum of the powers 
of y terms appearing in the expression. This is equal to 



k 

E 



{b,+b,) 



Where the addition is done mod-2. This can be written as 



E 

jeMi(c,c) 

hije{y,xy.x'^y} 



(bj+bj) = |{j e Mi\hij e {y,xy,x^y}}\ 



Let L C Ml be the set of indices j € Mi where hij G 
{y, xy, x'^y}. Since the power of y on the right hand side is 
equal to one, the cardinality of L must be odd. We count 
the number of solutions of Q as follows: Let L C Mi be 
arbitrary with an odd cardinality. For j ^ Mi choose gij and 
hij ai-bitrarily (10'''^™i choices), for j e Mi\L let gij = 1 
and hij = 1 (1 choice). Since the cardinality of L is assumed 
to be odd, it should have at least one element (say j*). for j e 
L\{j*} choose gij from {l,x,x^} and hij from {y,xy,x^y} 
arbitrarily (9'~^ choices where I — \L\). Also choose gij- 
arbitrarily from {l,x,x^} (3 choices). After moving terms to 
the other side we will get the following expression 

t _ ^some power some power 

Note that, by construction, the power of y on the right hand 
side has to be equal to y and hence hij<. has a unique solution 
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in {y,xy,x^y}. Therefore, it turns out that the number of 
choices for the case /3i = 1 is equal to 



Define 



10' 



1=1 
I odd 

Note that if mi = the above expression is defined to be 
zero. 

Now consider the case where jSi = 0. In this case it is possible 
to have mi = 0. We will first consider this case. Since mi = 0, 
for all j = 1 , • • • , fe we have bj = bj . On the other hand, 
since u ^ u, there must exist an index j G 1, • • • ,k such that 
flj 7^ Uj. It can be shown that the total number of choices in 
this case is equal to 

lO'^-iS + lO'^-^S + lO'^-^S + • • • + 10'=-'"^3 
_ lo'=-"2(io™2 _ 1) 

~ 3 
if 7^ 1 and it is equal to 

lO'^-^S + lO'^-^S + lO'^-^S + • • • + lO'^-^M 



10'' 



if = 1. The next and the last case is where (3i = and 
mi ^ 0. In this case the argument is similar to the case where 
Pi = 1 (and hence mi ^ 0). The difference here is that we 
need to choose a subset L of Mi with an even cardinality I. 
For Z > the argument is similar to the case with Pi = 1 and 
for I = 0, the argument is similar to the case with Pi = and 
mi = 0. Therefore the number of choices in this case is equal 
to 



+ E 

1=2 

I even 



mi 



10 



if 6*^ 7^ 1 and it is equal to 

10fe-™2(io"2 +2) 



1=2 

even 



mi 
/ 



10 



if 9, - 1. 

Since the total number of choices is equal to 10*^" the asser- 
tion about the conditional probability in the lemma follows. 
For a fixed u € Dg, the number of u's in Dg such that 
mi{u,u) = mi and m2{u,u) = m2 is calculated as follows. 
Fix mi positions out of k positions ((^J choices) and in these 
positions let bj = bj + 1 and aj arbitrary (3™! choices). Fix 
m2 positions among the remaining k — m,i positions ( ( 



m2 



choices) and in these positions let bj — bj and choose aj ^ Uj 
(2™2 choices). It follows that the total number of choices for 
u is equal to 



nil, 1712, ma 
k \ fk — nil 
mi) \ m2 



. 3™! • 2'^ 



mi , . 

^(-i) = E ( 7 ) 9' 
1=1 ^ ^ 



I odd 



B{mi,m2) = 
C{mi,m2) = 



(10"2 +2) 
10™^ 

(10^2 _ 1) 

10"*2 



+ E 

1=2 

I even 

mi 

+ E 

1=2 

I even 



Using the above lemma and definitions, the expected value of 
the average probability of error can be upper bounded by: 

E{Pa^g(err)} 

k fc— mi n n—ni / \ / \ i 

mi— m2— rii— 712— 



I (x • {y, xy, x^yr-x {x, x^^-x {l}-"i-"2) n A^^{X\y) \ 
Define 

D{ni,n2;c,y) = 

I (c • {y, xy, x^yr^x{x, x^^^x {l}-"i-»2) n A^{X\y)\ 
This quantity can be upper boimded by 

I (c • {y, xy, x^yr^ x {l,x, a;^}"-"^ n A^,{X\y) \ 

and in turn we have the following lemma: 

Lemma IV.2. Let y € 3^" be an arbitrary channel output 
sequence. For any x G A'^{X\y), we have 

I (c • {y, xy, xST' x{\,x, a;^}"-"^ ) n A^,{X\y) \ 



< 



ni 



Where the random variable [X] takes value from the set of 
cosets of {\,x,x^} in Dg- 

Proof: First we prove the following: 



(c • {y, xy, x^yr^ x {l,x, x^}"""^ ) n A^{X) \ 

< fn\^^[H{X\[X])+0{e)] 



The cardinality in question is related to the conditional entropy 
of a random variable W jointly distributed with X which 
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satisfies the constraints of the following optimization problem: Therefore 

' gei>6wei>6 ^ III 

s.t. Using the equality li{{X\) = h{p) we get H{W\X) < 

E/ \ _ p f \ H{X\[X]) + h{a). Finally we use Stirling's approximation 

J2 p{g-w-\w)=Px{9) \{c-{y,xy,x''yr' x {l,a;,a;2}"-"^) nyl^(X)| 



The generalization to the statement of the lenuna is relatively 
straight forward and is omitted. 



E E Pix,w) = l-a 



9eT>ewe{y,xy,x^y} ^^^^^ ^^^^ S > 0, there ex- 

ists an integer Mi{d) such that for mi > Mi{S), 
where a = and the minimization is over all probability ^("ii), B(mi, ma), C(mi, ma) < It can also be 

massfunctionsonD6xD6.Letp = Px({l,a;,a;2}).Itcanbe shown that for all 6' > 0, there exists an integer M2(<5') 
shownthatif a > |l-2j5|, the following distribution satisfies such that for ma > M2{6'), A{mi) < ^°2{i-5')' 
the KKT conditions for this optimization problem. i3(mi, ma), C(mi, ma) < ''^'a(i-i5') ^ • 

For arbitrary 6, 6' > 0, we break the expected error probability 



1 -■R.j-r. , : , N r T i uito several terms as follows: 

^ g {y^xy,x^y} 

, w S {1, X, x^} 



^px(x)px(x.u;) ;«e{i,;y;xM E{P„,,(err)} = Pi(5)+ E P2{m„6') + P,{6,5') 



2p(l-p) 



The entropy of this joint pmf can be found to be equal to 
i/(X,W) = 2/f(X)+M?^i^,l^^,l^,l^) 



mi=0 

where 



-2ih(j)) k k — mi n n— rii / i \ / 1 

■ ■ ■ ■ ' /v \ I 1^ TTli 



where p = Pv-(|1, x, x^j) and h is the entropy fimction. Next ^ ^ ^ ^ Vmi M mo 

we prove that tor a > |1 — ip\. 



E EEE :)r:r •3--2"^^- 



2p-l + a l-2p + a l-a l-a,^ , , • -A(mi)"^i3(mi, m2)"-"^-"^C(mi, ma)"^ 



For a = this statement is trivial. Assume a 7^ 0. Note that 

2p-l + a l-2p + a l^a l-Q-,^ P2(mi,<5') = 



1 1 

(c-{2/,X2/,x2yPx{x,x2px{l}"-"^-"^)nA^(X|y)| 



2p - 1 + g 2p - 1 + g _ 1 - 2p + g 1 - 2p + g V V I I ( "' M 3'' 



1 - a 



m2=M^{S') ni=0 ?i2=0 



" \^.i^^(mi)"^B(mi,m2)"-"^-"^C(mi,m2)"= 



= — g 



1 ("ITTii ?T. Q?l 

2p-l-|-a, 2p-l + al-2p + g 77 o 

+ ^ |(c. {t/,xt/,xM"^xR^'rx{ir-"^-"^) n A^(X|y)| 



l-2p + g 

log hlogg 

la 



= a 



2p- 1 -l-g ■ 
M ^ ) - /i(a) 



log2[6(l-5)]--log2 6-if(X|y) 



log2[3(l -S)]-'^ log2 3 - H{X\ [X]Y)+0(e) 



(1 - g)log(l - g) + (1 - g) P3iS,S') is defined similar to Pi{S) except that the 

first summation runs from to Mi{S) — 1 and the second 
sunnmation runs from to M2(S') — 1. Next we show that 

- (1 - g) log(l - g) + (1 - g) 

f r k 
2p-l + Q ^ Pi(5) <exp2<^-n ' 

= ahi ) + (1 - g) + /i(g) L 

la 

Since the function /i( ) is convex, for two points Xi, X2 € [0, 1] ^"'^ 
and a number g e [0, 1] we have P2(mi 5') 

g/i(xi) + (1 — g)/i(xa) < h{axi + (1 — g)xa) f 

< exp2 < ^ 

Let xi = and xa = 5 to get I- 

2p — 1 + a 2p — 1 + a 1 — a note that P3{S, S') is independent of the rate R (It can be 

2g ^ + (1 ~ Q^) < ''■I 2 ' 2 ^ ~ ^^^^ shown that this term goes to zero as n increases regardless of 
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the value of R). We have 



For a fixed mi, consider 



k 

mi 



k — mi 

1712 



k 

mi 



fc — mi^ 



m2 



k—m-i n n—n-i 

= E E E E 

mi=Mi((5) m2=0 ni=0 n2=0 

^ • i^yl(mirS(mi,m2)"-"^-"^C(mi,m2r- 
\{c- {y,xy,x^yr'x{x,x^r-x{ir 

k k—mi n n— ni 

^ E E E E 

mi=Mi((5) rn2=0 ni=0 n2=0 

1 1 10™! 

I (c • {t/, x2y}"^x {x, x'r^x n 

^ s E"(:,)C:„r)^^--^"" (eaby'"- 

n n— rii 

E E 

m— n2— 

|(c-{y,a;2/,xM"^x{x,:c2px{l} 



n— ni — n2 



nA^(^l2/)| 



Note that the result of the last two simunations is simply equal 
to 



\A^{X\y)\ = 2"[^(^l^)+o(^)] 



Therefore, 



mi=Mi(5) "12=0 \ V 



< ( ^ \n . ^n[H{X\Y)+0{e)] 



K / J \ k—mi / 



tTl2=0 



/c — TOi 

m2 



Note that the result of the last summation is equal to s'*-™! 
and hence the result of the last two summations is equal to 
6*^. Therefore, 



Pi (5) < exp2 \ -n 



log2[6(l-,5)]--log2 6-if(X|r) 



k 

mi 



k — mi 
m2 



• 3™^ • 2™^ 



P2(mi,5') = 

k—m\ n n—ni 

E EE 

m2=M2((5') ni=0 712=0 

^ • ^^(mi)"^B(mi,m2)"-"-"^C(mi,m2)"^- 
I (c • {t/, x^yr^x{x, x^^^x n 

k—mi n n—rii 

< EE'"" ' 1 • 3'"' • 

TO2=M2(5') ni=0 n2=0 



k \ fk — rn i 
mi) \ m2 



1 



1 



2Qmi _ gmi 



10™1 + 8"! 



2(1-^0 

ni— "2 



2(1 

|(c-{y,xy,x2y}"ix{a;,a;2}"^x{l}" 



n^:'(x|2/)| 



1 1 




_8mr 


m 




n— m 




[ 2(1 


-5') J 




[ 2(1-5') J 





(c • {y, xy, x^yr^x{l, x, a;^}"-"^) n A^{X\y) \ 

k—mi 



^ E E 

TO2=M2(5') "1=0 



k 
mi 



k — mi 
m2 



1 1 


'lO"*! 


- 8"i' 




'10™! -\ 




n— m 




i 2(1 


-5') J 




[ 2(1- 


5') J 





ni/ 

k—rn 1 

= V 

m2=M2(5') 

1 1 2"['^(^IW^)+0(^)I. 



fc \ fk — mi 



2™i . 





'10™! 




"1 


'lO"! +8'"i' 


n—n\ 


C) 


[ 2(1 


-5') J 




[ 2(1-5') J 





k \ fk — nil 



n 

E 

ni=0 

k—7ni 

= E 

m2=M2(i5') 
1 1 
^Qmin 2^ 

2(1 - S') 2(1 - 5') 

< 2"[^(^l[^l'^)+°(^)l 

- (3(1-5'))" 

3ml ^ I I . 2^2 

m2=0 



mi/ \ m2 

2n[H(X|[X]r)+0(e)]. 



(3(1 



J*^ nn[H{X\[X]Y)+0{e)]f ^ \omink 

-5'))" UJ 
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Note that for a fixed mi 

< exp2 < ^n. 



) is a polynomial in k. Hence, 



log2[3(l - 6)]~^\og^3- H{xmY) + Oie) 



We observe that if R < logs [6(1 - S)] - H{X\Y) 
then Pi{S) goes to zero as n increases and if 



R 



< 



log 



log2 3 



{log^[3{l-S')]-H{X\[X]Y)} then 

X]mi=o~^ P2{'mi,5') vanishes as the block length increases. 
Therefore, If both conditions are satisfied the expected value 
of the block error probability goes to zero. In conclusion. 
The rate R is achievable if 

f R<\ogM^-S)]-H{X\Y) 



R < 



log, 6 



{log2 [3{1-S')]-H{X\[X]Y)} 



Since 6 and 6' are arbitrary, we conclude that the rate R* is 
achievable where 

R* min nog2 6 - HiX\Y), [log^ 3 - HiX\[X]Y)] 



V. Comparison With Abelian Group Codes 

The only Abelian group of size 6 is = {0, 1, • • • , 5} 
where the group operation is addition mod-6. The best 
achievale rate using abelian group codes over Zg is known 
to be [Uj 



R* =min (log^e - H{X\Y) 



log^G 



[log^3-H(X\[X]sY)] 



log2 3 

,log^6[l^H{X\[X]2Y)] 



where [X]^ takes values from cosets of {0, 2, 4} and [X]2 
takes values from cosets of {0, 3}. In the following example 
we show that the achievable rate using the new code can be 
strictly larger than the rate achievable using Abelian group 
codes. 

A. An Example 

We give an example where the capacity of group codes 
is zero whereas the constructed code achieves a strictly 
positive rate. Consider the channel depicted in the figure 
below where ei — 0.1, ei = 0.2 and ei = 0.15. If we 



Fig. 2: Channel: The first column on the left shows the input 
labels in Zg and the second column shows the labels in Dg. 




maximize over all possible labellings of the channel input 
alphabet, it can be shown that both coding schemes achieve 
the symmetric capacity of the channel which is equal to 
0.0139 bits per channel use. However, if the labels are as- 
sumed to be fixed, the achievable rate using pseudo-group 
codes is equal to R* = min(0.0139, 0.0227) = 0.0139 and 
the achievable rate using Abelian group codes is equal to 
R = min(0.0139, 0.0227, 0) = 0. Indeed using the converse 
provided in | ,14J we can show that the capacity of Abelian 
group codes over this channel is equal to zero. We observe 
that for this channel, the codes over Dq outperforms the code 
over Zg. 

B. Comparison 

If we compare the two achievable rate regions, we observe 
that for the case of abelian group codes there is an additional 
term in the minimization which can be explained by the 
additional structure of the abelian group codes. Indeed, the 
pseudo-group code over Dg is additive (homomorphic) with 
respect to the y generator and is not homomorphic with 
respect to the x generator whereas Abelian group codes are 
homomorphic with respect to both of their generators. This 
means compared to Abelian Group codes, the constructed 
codes gain a higher rate by reducing the structure. 

VI. Conclusion 

We have shown that good structured codes over non- Abelian 
groups do exist. We constructed codes over the smallest non- 
Abelian group Dg and showed that the performance of these 
codes is superior to the performance of Abelian group codes 
of the same alphabet size. 
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